Automatic classification of mammography reports by BI-RADS breast tissue composition class

نویسندگان

  • Bethany Percha
  • Houssam Nassif
  • Jafi A. Lipson
  • Elizabeth S. Burnside
  • Daniel L. Rubin
چکیده

Because breast tissue composition partially predicts breast cancer risk, classification of mammography reports by breast tissue composition is important from both a scientific and clinical perspective. A method is presented for using the unstructured text of mammography reports to classify them into BI-RADS breast tissue composition categories. An algorithm that uses regular expressions to automatically determine BI-RADS breast tissue composition classes for unstructured mammography reports was developed. The algorithm assigns each report to a single BI-RADS composition class: 'fatty', 'fibroglandular', 'heterogeneously dense', 'dense', or 'unspecified'. We evaluated its performance on mammography reports from two different institutions. The method achieves >99% classification accuracy on a test set of reports from the Marshfield Clinic (Wisconsin) and Stanford University. Since large-scale studies of breast cancer rely heavily on breast tissue composition information, this method could facilitate this research by helping mine large datasets to correlate breast composition with other covariates.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Approaches to Automatic BI-RADS Classification of Mammography Reports

The average American radiologist interprets at least 1,777 mammogram reports each year, or approximately one new mammogram every 70 minutes [1]. Because radiologists interpret so many mammograms and because the proper interpretation of a screening mammogram is often a matter of life or death for the woman involved, various attempts have been made to streamline the mammography reporting process ...

متن کامل

Comparison of Danish dichotomous and BI-RADS classifications of mammographic density

BACKGROUND In the Copenhagen mammography screening program from 1991 to 2001, mammographic density was classified either as fatty or mixed/dense. This dichotomous mammographic density classification system is unique internationally, and has not been validated before. PURPOSE To compare the Danish dichotomous mammographic density classification system from 1991 to 2001 with the density BI-RADS...

متن کامل

Automatic evaluation of breast density in mammographic images

The goal of this master thesis is to develop a computerized method for automatic estimation of the mammographic density of mammographic images from 5 different types of mammography units. Mammographic density is a measurement of the amount of fibroglandular tissue in a breast. This is the single most attributable risk factor for breast cancer; an accurate measurement of the mammographic density...

متن کامل

BI-RADS 3: Current and Future Use of Probably Benign

Purpose of Review Probably benign (BI-RADS 3) causes confusion for interpreting physicians and referring physicians and can induce significant patient anxiety. The best uses and evidence for using this assessment category in mammography, breast ultrasound, and breast MRI will be reviewed; the reader will have a better understanding of how and when to use BI-RADS 3. Recent Findings Interobserv...

متن کامل

Automated detection of ambiguity in BI-RADS assessment categories in mammography reports.

An unsolved challenge in biomedical natural language processing (NLP) is detecting ambiguities in the reports that can help physicians to improve report clarity. Our goal was to develop NLP methods to tackle the challenges of identifying ambiguous descriptions of the laterality of BI-RADS Final Assessment Categories in mammography radiology reports. We developed a text processing system that us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of the American Medical Informatics Association : JAMIA

دوره 19 5  شماره 

صفحات  -

تاریخ انتشار 2012